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Abstract 

Given a finite nonempty sequence of integers S, by grouping adjacent terms it is always possible 
to write it, possibly in many ways, as S = X Y k , where X and Y are sequences and Y is nonempty. 
Choose the version which maximizes the value of k: this k is the curling number of S. The curling 
number conjecture is that if one starts with any initial sequence S, and extends it by repeatedly 
appending the curling number of the current sequence, the sequence will eventually reach 1. The 
conjecture remains open, but we will report on some numerical results in the case when S consists 
of only 2's and 3's. 
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1. The curling number conjecture 

Let S be a finite nonempty sequence of integers. By grouping adjacent terms, it is always possible to 
write it as S = XYY ••• Y = X Y k , where X and Y are sequences of integers and Y is nonempty. 
There may be several ways to do this: choose the one that maximizes the value of k: this k is the 
curling number of S, denoted by k(S). 

For example, ifS = 012212212 2, we could write it as X Y 2 , where X = 01221221 and 
Y = 2, or as X y 3 , where X = and Y = 12 2. The latter representation is to be preferred, since 
it has k = 3, and as k = 4 is impossible, the curling number of this S is 3. 

The curling number conjecture is that if one starts with any initial sequence S, and extends it 
by repeatedly appending the curling number of the current sequence, the sequence will eventually 
reach 1. In other words, if So = S is any finite nonempty sequence of integers, and we define 

S n+1 =S n k(S n ) forn>0, (1) 

then the conjecture is that for some m > we will have k(S m ) = 1. 

For example, suppose we start with So = 2 3 2 3. By taking X = 0, Y = 2 3, we have So = Y 2 , 
so k(S ) = 2, and we get Si = 2 3 2 3 2. By taking X = 2, Y = 3 2 we get fc(Si) = 2, S 2 = 2 3 2 3 2 2. 
By taking X = 2 3 2 3, Y = 2 we get k(S 2 ) = 2, S 3 = 2 3 2 3 2 2 2. Again taking A = 232 3, Y = 2 



we get A; (S3) = 3, S4 = 2323222 3. Now, unfortunately, it is impossible to write S4 = X Y k with 
k > 1, so k(S4) = 1, S5 = 232322231, and we have reached a 1, as predicted by the conjecture. 
(If we continue the sequence from this point, it appears to join Gijswijt's sequence, discussed in 
Section [5J) 

The conjecture is stated in PQ, [3], [I], and is mentioned in several entries in [2]. Some of the 
proofs in [I] could be shortened if the conjecture were known to be true. All the available evidence 
suggests that the conjecture is true, but it has so far resisted all attempts to prove it. 

Notation. Y k means Y Y ■ ■ ■ Y , where Y is repeated k times. denotes the empty sequence. 
We usually separate the parts of a sequence by small spaces. 

2. Initial sequences of 2's and 3's 

One way to approach the problem is to consider initial sequences So that contain only 2's and 3's, 
and to see how far such a sequence will extend using the rule before reaching a 1. 

Let /x(n) denote the maximal length that can be achieved before a 1 appears, for any starting 
sequence consisting of n 2's and 3's. The Curling Number Conjecture implies that fJ.(n) < 00 for 
all n. Reference pQ gave [i{n) for 1 < n < 30. Since then we have computed the values of n(n) for 
all n < 52, and we have lower bounds, which are probably the exact values, for 53 < n < 80. The 
results are shown in Table [l] and Figure [Tl The values of /i(n) also form sequence A094004 in [2]. 
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Table 1: n(n), the record for a starting sequence of n 2's and 3's. Entries marked with a dagger 
(f) are only lower bounds but are conjectured to be exact. 

As can be seen from Fig. [TJ up to n = 52, fi(n) increases in a piecewise linear manner. At 
the values n = 1, 2, 4, 6, 8, 9, 10, 11, 14, 19, 22, 48 (entry A160766 in [2J) there is a jump, but at the 
other values of n up to 52, /x(n) is simply fi(n — 1) + 1. Table [2] gives the starting sequences where 
/j,(n) > fi(n — 1) + 1. Up to n = 52, such sequences are always unique and start with a 2 (except 
for n = 1). 

Note from Table [T] that 

M (n) = n + 120 for 22 < n < 47 . (2) 
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Figure 1: Known values of ;u(n), the maximal length of a sequence produced by any starting 
sequence of n 2's and 3's, stopping when first 1 is reached. The values for 53 < n < 80 are only 
lower bounds, but are conjectured to be exact. 

That is, in this range one cannot do any better than taking the starting sequence for n = 22 and 
prefixing it by an irrelevant sequence of 47 — n 2's and 3's. However, at n = 48 a new record-holder 
appears, and we have 

M (n) = n + 131 for 48 < n < 52 . (3) 

The main reason we were able to extend the search as far as n = 52 is the following lemma. 

Lemma 1. If fi(n) > fi(n — 1) + 1 then no starting sequence of n 2's and 3's that achieves n(n) 
can contain a subsequence W 4 for a nonempty sequence W. 

Proof. Let S be a starting sequence of length n that achieves fi(n) > fi(n — 1) + 1, say S = 
sis 2 ... s n , and let T = si s 2 . . . s n . . . s m(7i) (with Sj G {2, 3} for 1 < i < fi(n), s M („)+i {2, 3}) 
be its extension. Call S "weak" if s\ is never used in T, that is, if Si s 2 ... s r can be written as 
XY Sr+1 , X ^ 0, Y ^ 0, for all r satisfying n <r < /x(n) - 1. If S is weak, then fi(n) < /x(n - 1) + 1, 
since T is simply si followed by the extension of s 2 ■ ■ ■ s n . Hence S is not weak, and so for some 
r with n < r < /j,(n) — 1 we have s± s 2 . . . s r = Y k , where k = s r+ i > 2 and Y is a sequence that 
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Table 2: Starting sequences of n 2's and 3's for which fj.(n) > ^(n — 1) + 1, complete for 1 < n < 52. 

begins with S. If S were to contain a sequence of the form W 4 , we would have a contradiction 
unless W 4 were at the end of Y, for when growing the second copy of Y , after W 4 has appeared, 
the next term is > 4, whereas S only contains 2's and 3's. On the other hand, if W is at the end 
of Y, the next term is > 4, so k = 1, again a contradiction. ■ 

We also made use of a number of more obvious shortcuts, such as not considering a starting 
sequence s\ . . . s n if k(s\ . . . s n _i) = s n , since we may assume that we have already considered all 
starting sequences of length n — 1 . 

The lemma cuts down the number of starting sequences of 2's and 3's that must be considered. 
Even if we simply exclude sequences that contain four consecutive 2's or four consecutive 3's, the 
number of length n (see entry A135491 in [2]) drops from 2 n to constant-a™, where a = 1.839 . . .. 

Inspection of the best starting sequences in Table [2] suggests they must satisfy another condition, 
which however we have not been able to prove: namely that they do not contain two consecutive 
3's. This is true for all the best starting sequences of lengths n < 52. 

Making the assumption (as yet unjustified) that we need only consider starting sequences with 
at most three consecutive 2's and with no pair of adjacent 3's, we were able extend the search to 
n = 78. This produced three further jumps, at n = 68, 76 and 77, establishing that fj>(n) > n + 132 
for 68 < n < 75, fi(n) > n + 133 for n = 76 and n(ri) > n + 173 for 77 < n < 80. The corresponding 
starting sequences are shown in Table |3j 

n Starting sequence 

68 223223222322323222322232232223223222322323222322 
23223222322322232232 

76 232223223222322323222322232232232223223222322323 
2223222322322322232232223223 

77 223222323222322232232223222323222322232232223222 
32322232223223232223223222323 



Table 3: Starting sequences of n 2's and 3's for which /x(n) > fx(n — 1) + 1, conjectured to be 
complete for 53 < n < 80. 

We have not succeeded in finding any algebraic constructions for good starting sequences. 
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However, one simple construction enables us to obtain lower bounds on n(n) for some larger values 
of n. Let S be a sequence of length n that achieves fJ,(n), and let T be the sequence of length fi(n) 
that it generates (up to just before the first 1 appears). Then in some cases the starting sequence 
T S will extend to TT 2 and beyond before reaching a 1. For example, taking S to be the length-48 
sequence in Table [2| the sequence T S has length 179 + 48 = 227 and extends to a total length of 
596 before reaching a 1, showing that /x(227) > 596. 

It would be nice to have some further exact values of /i(n), even though they will require 
extensive computations. Can the especially good starting sequences shown in Tables [2] and [3] 
(especially at lengths 22, 48 and 77) be generalized? What makes them so special? And above all, 
what is the asymptotic behavior of n(n)? 

3. Gijswijt's sequence 

If we simply start with Sq = 1, and generate an infinite sequence by continually appending the 
curling number of the current sequence, as in ([!]), we obtain 

1,1,2,1,1,2,2,2,3,1,1,2,1,1,2,2,2,3,2,1,1,2,1,1,2,2,2,3,1,1,2,1,1,... . 

This is Gijswijt's sequence, invented by Dion Gijswijt when he was a graduate student at the 
University of Amsterdam, and analyzed in [I]. It is entry A090822 in [2]. 

The first time a 4 appears is at term 220. One can calculate quite a few million terms without 
finding a 5 (as the authors of [Ij discovered!), but in pQ it was shown that a 5 eventually appears 
for the first time at about term 

io 1023 . 

Reference [T| also shows that the sequence is in fact unbounded, and conjectures that the first time 
that a number m > 6 appears is at about term number 

m— 1 




a tower of height m — 1. 

There is another question, also still open, which relates Gijswijt's sequence to the discussion 
in the previous section. If we start with an initial sequence S of 2's and 3's, extend it until we 
reach the first 1, say at the (k + l)st step, and then keep going, it appears that the result is always 
simply the first k terms of the extension of S, followed by Gijswijt's sequence. In other words, there 
is never any interaction between the first k terms of the extension of S and an initial segment of 
Gijswijt's sequence when computing curling numbers after the kth step. This seems plausible, but 
we would not be surprised if there was a counterexample. It would be nice to have this question 
settled one way or the other. 

One final remark: To avoid l's in the sequence, we might define h(S) = m&x{k(S), 2}, and 
replace the recurrence ([I]) by 

5 n+ i = S n h(S n ) for n > , (4) 

If we start with Sq = 1 and use the rule Q to extend it, the resulting sequence (A091787) is again 
unbounded, and now it is possible to compute exactly when the first 5 appears, which is at step 

77709404388415370160829246932345692180. 

See [lj for further information. 
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